Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 3533892 |
| Missing cells | 549641 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 424 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 674.0 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Categorical | 13 |
|---|---|
| Numeric | 8 |
| Text | 3 |
activity_year has constant value "" | Constant |
| Dataset has 424 (< 0.1%) duplicate rows | Duplicates |
mortgage_term is highly imbalanced (70.9%) | Imbalance |
loan_outcome is highly imbalanced (56.3%) | Imbalance |
property_value_ratio has 155646 (4.4%) missing values | Missing |
combined_loan_to_value_ratio has 188622 (5.3%) missing values | Missing |
metro_name has 131511 (3.7%) missing values | Missing |
income is highly skewed (γ1 = 636.5541623) | Skewed |
loan_amount is highly skewed (γ1 = 1324.67877) | Skewed |
property_value_ratio is highly skewed (γ1 = 1604.288336) | Skewed |
combined_loan_to_value_ratio is highly skewed (γ1 = 1654.312096) | Skewed |
Reproduction
| Analysis started | 2024-04-08 16:02:15.766564 |
|---|---|
| Analysis finished | 2024-04-08 16:07:09.256306 |
| Duration | 4 minutes and 53.49 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
race
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| White | |
|---|---|
| Latino | |
| Race NA | |
| Black | |
| Asian | |
| Other values (2) | 22936 |
Length
| Max length | 16 |
|---|---|
| Median length | 5 |
| Mean length | 5.4068254 |
| Min length | 5 |
Characters and Unicode
| Total characters | 19107137 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 2206416 | |
| Latino | 442455 | 12.5% |
| Race NA | 379947 | 10.8% |
| Black | 252901 | 7.2% |
| Asian | 229237 | 6.5% |
| Native American | 16968 | 0.5% |
| Pacific Islander | 5968 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 2206416 | |
| latino | 442455 | 11.2% |
| race | 379947 | 9.7% |
| na | 379947 | 9.7% |
| black | 252901 | 6.4% |
| asian | 229237 | 5.8% |
| native | 16968 | 0.4% |
| american | 16968 | 0.4% |
| pacific | 5968 | 0.2% |
| islander | 5968 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2923980 | |
| t | 2665839 | |
| e | 2626267 | |
| W | 2206416 | |
| h | 2206416 | |
| a | 1350412 | |
| n | 694628 | 3.6% |
| c | 661752 | 3.5% |
| A | 626152 | 3.3% |
| L | 442455 | 2.3% |
| Other values (15) | 2702820 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14387532 | |
| Uppercase Letter | 4316722 | 22.6% |
| Space Separator | 402883 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2923980 | |
| t | 2665839 | |
| e | 2626267 | |
| h | 2206416 | |
| a | 1350412 | |
| n | 694628 | 4.8% |
| c | 661752 | 4.6% |
| o | 442455 | 3.1% |
| l | 258869 | 1.8% |
| k | 252901 | 1.8% |
| Other values (6) | 304013 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 2206416 | |
| A | 626152 | 14.5% |
| L | 442455 | 10.2% |
| N | 396915 | 9.2% |
| R | 379947 | 8.8% |
| B | 252901 | 5.9% |
| P | 5968 | 0.1% |
| I | 5968 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 402883 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18704254 | |
| Common | 402883 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2923980 | |
| t | 2665839 | |
| e | 2626267 | |
| W | 2206416 | |
| h | 2206416 | |
| a | 1350412 | |
| n | 694628 | 3.7% |
| c | 661752 | 3.5% |
| A | 626152 | 3.3% |
| L | 442455 | 2.4% |
| Other values (14) | 2299937 |
Common
| Value | Count | Frequency (%) |
| 402883 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19107137 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2923980 | |
| t | 2665839 | |
| e | 2626267 | |
| W | 2206416 | |
| h | 2206416 | |
| a | 1350412 | |
| n | 694628 | 3.6% |
| c | 661752 | 3.5% |
| A | 626152 | 3.3% |
| L | 442455 | 2.3% |
| Other values (15) | 2702820 |
sex
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Male | |
|---|---|
| Female | |
| NA | 206101 |
| Marked both | 1552 |
Length
| Max length | 11 |
|---|---|
| Median length | 4 |
| Mean length | 4.5672629 |
| Min length | 2 |
Characters and Unicode
| Total characters | 16140214 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 2123247 | |
| Female | 1202992 | |
| NA | 206101 | 5.8% |
| Marked both | 1552 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 2123247 | |
| female | 1202992 | |
| na | 206101 | 5.8% |
| marked | 1552 | < 0.1% |
| both | 1552 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4530783 | |
| a | 3327791 | |
| l | 3326239 | |
| M | 2124799 | |
| F | 1202992 | 7.5% |
| m | 1202992 | 7.5% |
| N | 206101 | 1.3% |
| A | 206101 | 1.3% |
| r | 1552 | < 0.1% |
| k | 1552 | < 0.1% |
| Other values (6) | 9312 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12398669 | |
| Uppercase Letter | 3739993 | 23.2% |
| Space Separator | 1552 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4530783 | |
| a | 3327791 | |
| l | 3326239 | |
| m | 1202992 | 9.7% |
| r | 1552 | < 0.1% |
| k | 1552 | < 0.1% |
| d | 1552 | < 0.1% |
| b | 1552 | < 0.1% |
| o | 1552 | < 0.1% |
| t | 1552 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2124799 | |
| F | 1202992 | |
| N | 206101 | 5.5% |
| A | 206101 | 5.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1552 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16138662 | |
| Common | 1552 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4530783 | |
| a | 3327791 | |
| l | 3326239 | |
| M | 2124799 | |
| F | 1202992 | 7.5% |
| m | 1202992 | 7.5% |
| N | 206101 | 1.3% |
| A | 206101 | 1.3% |
| r | 1552 | < 0.1% |
| k | 1552 | < 0.1% |
| Other values (5) | 7760 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1552 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16140214 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4530783 | |
| a | 3327791 | |
| l | 3326239 | |
| M | 2124799 | |
| F | 1202992 | 7.5% |
| m | 1202992 | 7.5% |
| N | 206101 | 1.3% |
| A | 206101 | 1.3% |
| r | 1552 | < 0.1% |
| k | 1552 | < 0.1% |
| Other values (6) | 9312 | 0.1% |
co_applicant
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| No co-applicant | |
|---|---|
| Co-applicant | |
| NA | 8614 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.645129 |
| Min length | 2 |
Characters and Unicode
| Total characters | 48220412 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No co-applicant |
|---|---|
| 2nd row | No co-applicant |
| 3rd row | Co-applicant |
| 4th row | No co-applicant |
| 5th row | No co-applicant |
Common Values
| Value | Count | Frequency (%) |
| No co-applicant | 1966616 | |
| Co-applicant | 1558662 | |
| NA | 8614 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| co-applicant | 3525278 | |
| no | 1966616 | |
| na | 8614 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7050556 | |
| p | 7050556 | |
| o | 5491894 | |
| c | 5491894 | |
| - | 3525278 | |
| l | 3525278 | |
| i | 3525278 | |
| n | 3525278 | |
| t | 3525278 | |
| N | 1975230 | 4.1% |
| Other values (3) | 3533892 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39186012 | |
| Uppercase Letter | 3542506 | 7.3% |
| Dash Punctuation | 3525278 | 7.3% |
| Space Separator | 1966616 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7050556 | |
| p | 7050556 | |
| o | 5491894 | |
| c | 5491894 | |
| l | 3525278 | |
| i | 3525278 | |
| n | 3525278 | |
| t | 3525278 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1975230 | |
| C | 1558662 | |
| A | 8614 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3525278 |
Space Separator
| Value | Count | Frequency (%) |
| 1966616 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42728518 | |
| Common | 5491894 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7050556 | |
| p | 7050556 | |
| o | 5491894 | |
| c | 5491894 | |
| l | 3525278 | |
| i | 3525278 | |
| n | 3525278 | |
| t | 3525278 | |
| N | 1975230 | 4.6% |
| C | 1558662 | 3.6% |
Common
| Value | Count | Frequency (%) |
| - | 3525278 | |
| 1966616 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48220412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7050556 | |
| p | 7050556 | |
| o | 5491894 | |
| c | 5491894 | |
| - | 3525278 | |
| l | 3525278 | |
| i | 3525278 | |
| n | 3525278 | |
| t | 3525278 | |
| N | 1975230 | 4.1% |
| Other values (3) | 3533892 |
age
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| 25 through 34 | |
|---|---|
| 35 through 44 | |
| 45 through 54 | |
| 55 through 64 | |
| Less than 25 | |
| Other values (3) |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 12.972681 |
| Min length | 12 |
Characters and Unicode
| Total characters | 45844053 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 25 through 34 |
|---|---|
| 2nd row | 25 through 34 |
| 3rd row | 25 through 34 |
| 4th row | Less than 25 |
| 5th row | 25 through 34 |
Common Values
| Value | Count | Frequency (%) |
| 25 through 34 | 1134547 | |
| 35 through 44 | 951797 | |
| 45 through 54 | 615512 | |
| 55 through 64 | 398048 | 11.3% |
| Less than 25 | 195357 | 5.5% |
| 65 through 74 | 188633 | 5.3% |
| Greater than 74 | 48816 | 1.4% |
| Not Applicable | 1182 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| through | 3288537 | |
| 25 | 1329904 | |
| 34 | 1134547 | 10.7% |
| 35 | 951797 | 9.0% |
| 44 | 951797 | 9.0% |
| 45 | 615512 | 5.8% |
| 54 | 615512 | 5.8% |
| 55 | 398048 | 3.8% |
| 64 | 398048 | 3.8% |
| than | 244173 | 2.3% |
| Other values (6) | 672619 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7066602 | ||
| h | 6821247 | |
| 4 | 4904662 | |
| 5 | 4497454 | |
| t | 3582708 | |
| r | 3386169 | |
| o | 3289719 | |
| u | 3288537 | |
| g | 3288537 | |
| 3 | 2086344 | 4.6% |
| Other values (16) | 3632074 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24888420 | |
| Decimal Number | 13642494 | |
| Space Separator | 7066602 | 15.4% |
| Uppercase Letter | 246537 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 6821247 | |
| t | 3582708 | |
| r | 3386169 | |
| o | 3289719 | |
| u | 3288537 | |
| g | 3288537 | |
| s | 390714 | 1.6% |
| a | 294171 | 1.2% |
| e | 294171 | 1.2% |
| n | 244173 | 1.0% |
| Other values (5) | 8274 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 4904662 | |
| 5 | 4497454 | |
| 3 | 2086344 | |
| 2 | 1329904 | 9.7% |
| 6 | 586681 | 4.3% |
| 7 | 237449 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 195357 | |
| G | 48816 | 19.8% |
| N | 1182 | 0.5% |
| A | 1182 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 7066602 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25134957 | |
| Common | 20709096 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 6821247 | |
| t | 3582708 | |
| r | 3386169 | |
| o | 3289719 | |
| u | 3288537 | |
| g | 3288537 | |
| s | 390714 | 1.6% |
| a | 294171 | 1.2% |
| e | 294171 | 1.2% |
| n | 244173 | 1.0% |
| Other values (9) | 254811 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 7066602 | ||
| 4 | 4904662 | |
| 5 | 4497454 | |
| 3 | 2086344 | 10.1% |
| 2 | 1329904 | 6.4% |
| 6 | 586681 | 2.8% |
| 7 | 237449 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45844053 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7066602 | ||
| h | 6821247 | |
| 4 | 4904662 | |
| 5 | 4497454 | |
| t | 3582708 | |
| r | 3386169 | |
| o | 3289719 | |
| u | 3288537 | |
| g | 3288537 | |
| 3 | 2086344 | 4.6% |
| Other values (16) | 3632074 |
income
Real number (ℝ)
SKEWED 
| Distinct | 3447 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 112.50465 |
| Minimum | 1 |
|---|---|
| Maximum | 365001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 32 |
| Q1 | 55 |
| median | 83 |
| Q3 | 128 |
| 95-th percentile | 270 |
| Maximum | 365001 |
| Range | 365000 |
| Interquartile range (IQR) | 73 |
Descriptive statistics
| Standard deviation | 298.97478 |
|---|---|
| Coefficient of variation (CV) | 2.6574438 |
| Kurtosis | 679755.11 |
| Mean | 112.50465 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 636.55416 |
| Sum | 3.9757928 × 108 |
| Variance | 89385.921 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 44028 | 1.2% |
| 50 | 41831 | 1.2% |
| 52 | 39982 | 1.1% |
| 65 | 38563 | 1.1% |
| 55 | 38139 | 1.1% |
| 62 | 37488 | 1.1% |
| 70 | 36671 | 1.0% |
| 75 | 36475 | 1.0% |
| 48 | 36103 | 1.0% |
| 42 | 35587 | 1.0% |
| Other values (3437) | 3149025 |
| Value | Count | Frequency (%) |
| 1 | 510 | |
| 2 | 830 | |
| 3 | 1132 | |
| 4 | 1156 | |
| 5 | 1145 | |
| 6 | 1004 | |
| 7 | 896 | |
| 8 | 871 | |
| 9 | 880 | |
| 10 | 901 |
| Value | Count | Frequency (%) |
| 365001 | 1 | |
| 167923 | 1 | |
| 139000 | 1 | |
| 100000 | 1 | |
| 94000 | 1 | |
| 87360 | 1 | |
| 70844 | 1 | |
| 54000 | 1 | |
| 46560 | 1 | |
| 43316 | 1 |
loan_amount
Real number (ℝ)
SKEWED 
| Distinct | 668 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 288066.53 |
| Minimum | 5000 |
|---|---|
| Maximum | 1.106255 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 5000 |
|---|---|
| 5-th percentile | 85000 |
| Q1 | 155000 |
| median | 235000 |
| Q3 | 345000 |
| 95-th percentile | 645000 |
| Maximum | 1.106255 × 109 |
| Range | 1.10625 × 109 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 672267.5 |
|---|---|
| Coefficient of variation (CV) | 2.3337231 |
| Kurtosis | 2111981.5 |
| Mean | 288066.53 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 1324.6788 |
| Sum | 1.017996 × 1012 |
| Variance | 4.5194359 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 205000 | 124758 | 3.5% |
| 165000 | 121505 | 3.4% |
| 155000 | 121333 | 3.4% |
| 175000 | 118551 | 3.4% |
| 185000 | 118524 | 3.4% |
| 225000 | 117437 | 3.3% |
| 195000 | 114156 | 3.2% |
| 215000 | 112379 | 3.2% |
| 145000 | 110573 | 3.1% |
| 135000 | 104821 | 3.0% |
| Other values (658) | 2369855 |
| Value | Count | Frequency (%) |
| 5000 | 1399 | < 0.1% |
| 15000 | 2342 | 0.1% |
| 25000 | 5220 | 0.1% |
| 35000 | 10049 | 0.3% |
| 45000 | 17307 | 0.5% |
| 55000 | 31686 | |
| 65000 | 40960 | |
| 75000 | 50689 | |
| 85000 | 59398 | |
| 95000 | 61616 |
| Value | Count | Frequency (%) |
| 1106255000 | 1 | |
| 410475000 | 1 | |
| 46025000 | 1 | |
| 29005000 | 1 | |
| 25005000 | 1 | |
| 24005000 | 1 | |
| 18005000 | 1 | |
| 17555000 | 1 | |
| 17005000 | 1 | |
| 16505000 | 1 |
property_value_ratio
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 11686 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 155646 |
| Missing (%) | 4.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.397305 |
| Minimum | 0.008 |
|---|---|
| Maximum | 12967.896 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 0.008 |
|---|---|
| 5-th percentile | 0.557 |
| Q1 | 0.886 |
| median | 1.174 |
| Q3 | 1.611 |
| 95-th percentile | 2.842 |
| Maximum | 12967.896 |
| Range | 12967.888 |
| Interquartile range (IQR) | 0.725 |
Descriptive statistics
| Standard deviation | 7.4203197 |
|---|---|
| Coefficient of variation (CV) | 5.3104512 |
| Kurtosis | 2769149.4 |
| Mean | 1.397305 |
| Median Absolute Deviation (MAD) | 0.339 |
| Skewness | 1604.2883 |
| Sum | 4720439.9 |
| Variance | 55.061145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.134 | 6770 | 0.2% |
| 0.942 | 6707 | 0.2% |
| 1.057 | 6630 | 0.2% |
| 0.903 | 5988 | 0.2% |
| 1.018 | 5853 | 0.2% |
| 0.98 | 5744 | 0.2% |
| 0.994 | 5651 | 0.2% |
| 1.229 | 5611 | 0.2% |
| 1.318 | 5424 | 0.2% |
| 1.02 | 5322 | 0.2% |
| Other values (11676) | 3318546 | |
| (Missing) | 155646 | 4.4% |
| Value | Count | Frequency (%) |
| 0.008 | 1 | < 0.1% |
| 0.009 | 1 | < 0.1% |
| 0.01 | 1 | < 0.1% |
| 0.011 | 2 | < 0.1% |
| 0.013 | 3 | < 0.1% |
| 0.014 | 3 | < 0.1% |
| 0.016 | 2 | < 0.1% |
| 0.017 | 17 | |
| 0.018 | 1 | < 0.1% |
| 0.019 | 8 |
| Value | Count | Frequency (%) |
| 12967.896 | 1 | |
| 3010.999 | 1 | |
| 1832.974 | 1 | |
| 646.663 | 1 | |
| 486.491 | 1 | |
| 454.295 | 1 | |
| 418.116 | 1 | |
| 390.051 | 1 | |
| 336.373 | 1 | |
| 319.423 | 1 |
mortgage_term
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| 30 year mortgage | |
|---|---|
| Less than 30 years | 208431 |
| NA | 107144 |
| More than 30 years | 27462 |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 15.709038 |
| Min length | 2 |
Characters and Unicode
| Total characters | 55514042 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| 30 year mortgage | 3190855 | |
| Less than 30 years | 208431 | 5.9% |
| NA | 107144 | 3.0% |
| More than 30 years | 27462 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 30 | 3426748 | |
| year | 3190855 | |
| mortgage | 3190855 | |
| than | 235893 | 2.2% |
| years | 235893 | 2.2% |
| less | 208431 | 2.0% |
| na | 107144 | 1.0% |
| more | 27462 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7089389 | ||
| e | 6853496 | |
| a | 6853496 | |
| r | 6645065 | |
| g | 6381710 | |
| 3 | 3426748 | |
| 0 | 3426748 | |
| t | 3426748 | |
| y | 3426748 | |
| o | 3218317 | |
| Other values (8) | 4765577 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41120976 | |
| Space Separator | 7089389 | 12.8% |
| Decimal Number | 6853496 | 12.3% |
| Uppercase Letter | 450181 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6853496 | |
| a | 6853496 | |
| r | 6645065 | |
| g | 6381710 | |
| t | 3426748 | |
| y | 3426748 | |
| o | 3218317 | |
| m | 3190855 | |
| s | 652755 | 1.6% |
| h | 235893 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 208431 | |
| N | 107144 | |
| A | 107144 | |
| M | 27462 | 6.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3426748 | |
| 0 | 3426748 |
Space Separator
| Value | Count | Frequency (%) |
| 7089389 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41571157 | |
| Common | 13942885 | 25.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6853496 | |
| a | 6853496 | |
| r | 6645065 | |
| g | 6381710 | |
| t | 3426748 | |
| y | 3426748 | |
| o | 3218317 | |
| m | 3190855 | |
| s | 652755 | 1.6% |
| h | 235893 | 0.6% |
| Other values (5) | 686074 | 1.7% |
Common
| Value | Count | Frequency (%) |
| 7089389 | ||
| 3 | 3426748 | |
| 0 | 3426748 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55514042 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7089389 | ||
| e | 6853496 | |
| a | 6853496 | |
| r | 6645065 | |
| g | 6381710 | |
| 3 | 3426748 | |
| 0 | 3426748 | |
| t | 3426748 | |
| y | 3426748 | |
| o | 3218317 | |
| Other values (8) | 4765577 |
credit_model
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Equifax | |
|---|---|
| TransUnion | |
| Experian | |
| NA | |
| More than one | |
| Other values (2) | 46273 |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 7.8729882 |
| Min length | 2 |
Characters and Unicode
| Total characters | 27822290 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| Equifax | 1080038 | |
| TransUnion | 1023869 | |
| Experian | 861470 | |
| NA | 354371 | 10.0% |
| More than one | 167871 | 4.8% |
| Other | 41701 | 1.2% |
| Vantage | 4572 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| equifax | 1080038 | |
| transunion | 1023869 | |
| experian | 861470 | |
| na | 354371 | 9.2% |
| more | 167871 | 4.3% |
| than | 167871 | 4.3% |
| one | 167871 | 4.3% |
| other | 41701 | 1.1% |
| vantage | 4572 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 4273391 | |
| a | 3142392 | |
| i | 2965377 | |
| r | 2094911 | 7.5% |
| E | 1941508 | 7.0% |
| x | 1941508 | 7.0% |
| o | 1359611 | 4.9% |
| e | 1243485 | 4.5% |
| u | 1080038 | 3.9% |
| q | 1080038 | 3.9% |
| Other values (14) | 6700031 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22574416 | |
| Uppercase Letter | 4912132 | 17.7% |
| Space Separator | 335742 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 4273391 | |
| a | 3142392 | |
| i | 2965377 | |
| r | 2094911 | |
| x | 1941508 | |
| o | 1359611 | 6.0% |
| e | 1243485 | 5.5% |
| u | 1080038 | 4.8% |
| q | 1080038 | 4.8% |
| f | 1080038 | 4.8% |
| Other values (5) | 2313627 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1941508 | |
| T | 1023869 | |
| U | 1023869 | |
| A | 354371 | 7.2% |
| N | 354371 | 7.2% |
| M | 167871 | 3.4% |
| O | 41701 | 0.8% |
| V | 4572 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 335742 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27486548 | |
| Common | 335742 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 4273391 | |
| a | 3142392 | |
| i | 2965377 | |
| r | 2094911 | 7.6% |
| E | 1941508 | 7.1% |
| x | 1941508 | 7.1% |
| o | 1359611 | 4.9% |
| e | 1243485 | 4.5% |
| u | 1080038 | 3.9% |
| q | 1080038 | 3.9% |
| Other values (13) | 6364289 |
Common
| Value | Count | Frequency (%) |
| 335742 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27822290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 4273391 | |
| a | 3142392 | |
| i | 2965377 | |
| r | 2094911 | 7.5% |
| E | 1941508 | 7.0% |
| x | 1941508 | 7.0% |
| o | 1359611 | 4.9% |
| e | 1243485 | 4.5% |
| u | 1080038 | 3.9% |
| q | 1080038 | 3.9% |
| Other values (14) | 6700031 |
debt_to_income_ratio
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Healthy (<36%) | |
|---|---|
| Manageable (36-42%) | |
| Unmanageable (43-49%) | |
| Struggling (>50%) | |
| Exempt | 100070 |
Length
| Max length | 21 |
|---|---|
| Median length | 19 |
| Mean length | 16.811376 |
| Min length | 2 |
Characters and Unicode
| Total characters | 59409588 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Exempt |
|---|---|
| 2nd row | Exempt |
| 3rd row | Exempt |
| 4th row | Exempt |
| 5th row | Exempt |
Common Values
| Value | Count | Frequency (%) |
| Healthy (<36%) | 1359589 | |
| Manageable (36-42%) | 888042 | |
| Unmanageable (43-49%) | 822358 | |
| Struggling (>50%) | 326996 | 9.3% |
| Exempt | 100070 | 2.8% |
| NA | 36837 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| healthy | 1359589 | |
| 36 | 1359589 | |
| manageable | 888042 | |
| 36-42 | 888042 | |
| unmanageable | 822358 | |
| 43-49 | 822358 | |
| struggling | 326996 | 4.7% |
| 50 | 326996 | 4.7% |
| exempt | 100070 | 1.4% |
| na | 36837 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6490789 | 10.9% |
| e | 4880459 | 8.2% |
| l | 3396985 | 5.7% |
| 3396985 | 5.7% | |
| ( | 3396985 | 5.7% |
| % | 3396985 | 5.7% |
| ) | 3396985 | 5.7% |
| 3 | 3069989 | 5.2% |
| n | 2859754 | 4.8% |
| g | 2691388 | 4.5% |
| Other values (26) | 22432284 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28639164 | |
| Decimal Number | 10214770 | 17.2% |
| Uppercase Letter | 3570729 | 6.0% |
| Space Separator | 3396985 | 5.7% |
| Open Punctuation | 3396985 | 5.7% |
| Other Punctuation | 3396985 | 5.7% |
| Close Punctuation | 3396985 | 5.7% |
| Dash Punctuation | 1710400 | 2.9% |
| Math Symbol | 1686585 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6490789 | |
| e | 4880459 | |
| l | 3396985 | |
| n | 2859754 | |
| g | 2691388 | |
| t | 1786655 | 6.2% |
| b | 1710400 | 6.0% |
| y | 1359589 | 4.7% |
| h | 1359589 | 4.7% |
| m | 922428 | 3.2% |
| Other values (5) | 1181128 | 4.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3069989 | |
| 4 | 2532758 | |
| 6 | 2247631 | |
| 2 | 888042 | 8.7% |
| 9 | 822358 | 8.1% |
| 0 | 326996 | 3.2% |
| 5 | 326996 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1359589 | |
| M | 888042 | |
| U | 822358 | |
| S | 326996 | 9.2% |
| E | 100070 | 2.8% |
| N | 36837 | 1.0% |
| A | 36837 | 1.0% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 1359589 | |
| > | 326996 | 19.4% |
Space Separator
| Value | Count | Frequency (%) |
| 3396985 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3396985 |
Other Punctuation
| Value | Count | Frequency (%) |
| % | 3396985 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3396985 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1710400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32209893 | |
| Common | 27199695 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6490789 | |
| e | 4880459 | |
| l | 3396985 | |
| n | 2859754 | |
| g | 2691388 | |
| t | 1786655 | 5.5% |
| b | 1710400 | 5.3% |
| H | 1359589 | 4.2% |
| y | 1359589 | 4.2% |
| h | 1359589 | 4.2% |
| Other values (12) | 4314696 |
Common
| Value | Count | Frequency (%) |
| 3396985 | ||
| ( | 3396985 | |
| % | 3396985 | |
| ) | 3396985 | |
| 3 | 3069989 | |
| 4 | 2532758 | |
| 6 | 2247631 | |
| - | 1710400 | |
| < | 1359589 | |
| 2 | 888042 | 3.3% |
| Other values (4) | 1803346 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59409588 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6490789 | 10.9% |
| e | 4880459 | 8.2% |
| l | 3396985 | 5.7% |
| 3396985 | 5.7% | |
| ( | 3396985 | 5.7% |
| % | 3396985 | 5.7% |
| ) | 3396985 | 5.7% |
| 3 | 3069989 | 5.2% |
| n | 2859754 | 4.8% |
| g | 2691388 | 4.5% |
| Other values (26) | 22432284 |
combined_loan_to_value_ratio
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 94184 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 188622 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 114.10964 |
| Minimum | 0.2 |
|---|---|
| Maximum | 61224490 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 0.2 |
|---|---|
| 5-th percentile | 54.545 |
| Q1 | 80 |
| median | 90.323 |
| Q3 | 96.5 |
| 95-th percentile | 100 |
| Maximum | 61224490 |
| Range | 61224490 |
| Interquartile range (IQR) | 16.5 |
Descriptive statistics
| Standard deviation | 34791.559 |
|---|---|
| Coefficient of variation (CV) | 304.89587 |
| Kurtosis | 2875590.9 |
| Mean | 114.10964 |
| Median Absolute Deviation (MAD) | 6.677 |
| Skewness | 1654.3121 |
| Sum | 3.8172757 × 108 |
| Variance | 1.2104526 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 574948 | |
| 96.5 | 477727 | |
| 95 | 457784 | |
| 90 | 231637 | 6.6% |
| 97 | 222492 | 6.3% |
| 85 | 76875 | 2.2% |
| 100 | 70238 | 2.0% |
| 75 | 60473 | 1.7% |
| 70 | 28159 | 0.8% |
| 98.188 | 13998 | 0.4% |
| Other values (94174) | 1130939 | |
| (Missing) | 188622 | 5.3% |
| Value | Count | Frequency (%) |
| 0.2 | 2 | |
| 0.49 | 1 | |
| 0.52 | 1 | |
| 0.53 | 1 | |
| 0.58 | 1 | |
| 0.723 | 1 | |
| 0.76 | 1 | |
| 0.8 | 1 | |
| 0.81 | 1 | |
| 0.833 | 1 |
| Value | Count | Frequency (%) |
| 61224489.8 | 1 | < 0.1% |
| 13384132.93 | 1 | < 0.1% |
| 10400000 | 1 | < 0.1% |
| 3500000 | 1 | < 0.1% |
| 993446 | 1 | < 0.1% |
| 325785.714 | 1 | < 0.1% |
| 100000 | 2 | < 0.1% |
| 98189 | 6 | |
| 98000 | 1 | < 0.1% |
| 97749.875 | 1 | < 0.1% |
main_underwriter
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Desktop Underwriter | |
|---|---|
| Loan Prospector | |
| Technology Open to Approved Lenders | |
| Not Applicable | |
| No main Aus | |
| Other values (2) | 88496 |
Length
| Max length | 35 |
|---|---|
| Median length | 19 |
| Mean length | 19.042627 |
| Min length | 5 |
Characters and Unicode
| Total characters | 67294587 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Applicable |
|---|---|
| 2nd row | Not Applicable |
| 3rd row | Not Applicable |
| 4th row | Not Applicable |
| 5th row | Not Applicable |
Common Values
| Value | Count | Frequency (%) |
| Desktop Underwriter | 1857165 | |
| Loan Prospector | 488201 | 13.8% |
| Technology Open to Approved Lenders | 450317 | 12.7% |
| Not Applicable | 441373 | 12.5% |
| No main Aus | 208340 | 5.9% |
| Other | 88060 | 2.5% |
| Guaranteed Underwriting System | 436 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| desktop | 1857165 | |
| underwriter | 1857165 | |
| loan | 488201 | 5.7% |
| prospector | 488201 | 5.7% |
| technology | 450317 | 5.3% |
| open | 450317 | 5.3% |
| to | 450317 | 5.3% |
| approved | 450317 | 5.3% |
| lenders | 450317 | 5.3% |
| applicable | 441373 | 5.2% |
| Other values (8) | 1155761 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8842458 | |
| r | 7537899 | |
| o | 5772749 | 8.6% |
| t | 5183589 | 7.7% |
| 5005559 | 7.4% | |
| p | 4579063 | 6.8% |
| n | 3905965 | 5.8% |
| s | 3004459 | 4.5% |
| d | 2758671 | 4.1% |
| i | 2507750 | 3.7% |
| Other values (22) | 18196425 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54408234 | |
| Uppercase Letter | 7880794 | 11.7% |
| Space Separator | 5005559 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8842458 | |
| r | 7537899 | |
| o | 5772749 | |
| t | 5183589 | |
| p | 4579063 | |
| n | 3905965 | |
| s | 3004459 | 5.5% |
| d | 2758671 | 5.1% |
| i | 2507750 | 4.6% |
| w | 1857601 | 3.4% |
| Other values (11) | 8458030 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1857601 | |
| D | 1857165 | |
| A | 1100030 | |
| L | 938518 | |
| N | 649713 | 8.2% |
| O | 538377 | 6.8% |
| P | 488201 | 6.2% |
| T | 450317 | 5.7% |
| G | 436 | < 0.1% |
| S | 436 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5005559 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62289028 | |
| Common | 5005559 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8842458 | |
| r | 7537899 | |
| o | 5772749 | 9.3% |
| t | 5183589 | 8.3% |
| p | 4579063 | 7.4% |
| n | 3905965 | 6.3% |
| s | 3004459 | 4.8% |
| d | 2758671 | 4.4% |
| i | 2507750 | 4.0% |
| w | 1857601 | 3.0% |
| Other values (21) | 16338824 |
Common
| Value | Count | Frequency (%) |
| 5005559 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67294587 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8842458 | |
| r | 7537899 | |
| o | 5772749 | 8.6% |
| t | 5183589 | 7.7% |
| 5005559 | 7.4% | |
| p | 4579063 | 6.8% |
| n | 3905965 | 5.8% |
| s | 3004459 | 4.5% |
| d | 2758671 | 4.1% |
| i | 2507750 | 3.7% |
| Other values (22) | 18196425 |
tract_to_metro_income_percentage
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Middle (80-120%) | |
|---|---|
| Upper (>120%) | |
| Moderate (50-80%) | |
| Low (<50%) | 83585 |
| NA | 28788 |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 14.712087 |
| Min length | 2 |
Characters and Unicode
| Total characters | 51990927 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Middle (80-120%) |
|---|---|
| 2nd row | Middle (80-120%) |
| 3rd row | Middle (80-120%) |
| 4th row | Middle (80-120%) |
| 5th row | Middle (80-120%) |
Common Values
| Value | Count | Frequency (%) |
| Middle (80-120%) | 1520918 | |
| Upper (>120%) | 1386851 | |
| Moderate (50-80%) | 513750 | 14.5% |
| Low (<50%) | 83585 | 2.4% |
| NA | 28788 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| middle | 1520918 | |
| 80-120 | 1520918 | |
| upper | 1386851 | |
| 120 | 1386851 | |
| moderate | 513750 | 7.3% |
| 50-80 | 513750 | 7.3% |
| low | 83585 | 1.2% |
| 50 | 83585 | 1.2% |
| na | 28788 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5539772 | 10.7% |
| e | 3935269 | 7.6% |
| d | 3555586 | 6.8% |
| ) | 3505104 | 6.7% |
| 3505104 | 6.7% | |
| ( | 3505104 | 6.7% |
| % | 3505104 | 6.7% |
| 1 | 2907769 | 5.6% |
| 2 | 2907769 | 5.6% |
| p | 2773702 | 5.3% |
| Other values (17) | 16350644 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16915414 | |
| Decimal Number | 13987313 | |
| Uppercase Letter | 3562680 | 6.9% |
| Close Punctuation | 3505104 | 6.7% |
| Space Separator | 3505104 | 6.7% |
| Open Punctuation | 3505104 | 6.7% |
| Other Punctuation | 3505104 | 6.7% |
| Dash Punctuation | 2034668 | 3.9% |
| Math Symbol | 1470436 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3935269 | |
| d | 3555586 | |
| p | 2773702 | |
| r | 1900601 | |
| i | 1520918 | 9.0% |
| l | 1520918 | 9.0% |
| o | 597335 | 3.5% |
| a | 513750 | 3.0% |
| t | 513750 | 3.0% |
| w | 83585 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5539772 | |
| 1 | 2907769 | |
| 2 | 2907769 | |
| 8 | 2034668 | 14.5% |
| 5 | 597335 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2034668 | |
| U | 1386851 | |
| L | 83585 | 2.3% |
| N | 28788 | 0.8% |
| A | 28788 | 0.8% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 1386851 | |
| < | 83585 | 5.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3505104 |
Space Separator
| Value | Count | Frequency (%) |
| 3505104 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3505104 |
Other Punctuation
| Value | Count | Frequency (%) |
| % | 3505104 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2034668 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 31512833 | |
| Latin | 20478094 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3935269 | |
| d | 3555586 | |
| p | 2773702 | |
| M | 2034668 | |
| r | 1900601 | |
| i | 1520918 | 7.4% |
| l | 1520918 | 7.4% |
| U | 1386851 | 6.8% |
| o | 597335 | 2.9% |
| a | 513750 | 2.5% |
| Other values (5) | 738496 | 3.6% |
Common
| Value | Count | Frequency (%) |
| 0 | 5539772 | |
| ) | 3505104 | |
| 3505104 | ||
| ( | 3505104 | |
| % | 3505104 | |
| 1 | 2907769 | |
| 2 | 2907769 | |
| - | 2034668 | 6.5% |
| 8 | 2034668 | 6.5% |
| > | 1386851 | 4.4% |
| Other values (2) | 680920 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51990927 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5539772 | 10.7% |
| e | 3935269 | 7.6% |
| d | 3555586 | 6.8% |
| ) | 3505104 | 6.7% |
| 3505104 | 6.7% | |
| ( | 3505104 | 6.7% |
| % | 3505104 | 6.7% |
| 1 | 2907769 | 5.6% |
| 2 | 2907769 | 5.6% |
| p | 2773702 | 5.3% |
| Other values (17) | 16350644 |
lender_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Independent Mortgage Companies | |
|---|---|
| Banks | |
| Credit Union | |
| No definition | 10556 |
Length
| Max length | 30 |
|---|---|
| Median length | 30 |
| Mean length | 20.408673 |
| Min length | 5 |
Characters and Unicode
| Total characters | 72122048 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Banks |
|---|---|
| 2nd row | Banks |
| 3rd row | Banks |
| 4th row | Banks |
| 5th row | Banks |
Common Values
| Value | Count | Frequency (%) |
| Independent Mortgage Companies | 2099141 | |
| Banks | 1154250 | |
| Credit Union | 269945 | 7.6% |
| No definition | 10556 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| independent | 2099141 | |
| mortgage | 2099141 | |
| companies | 2099141 | |
| banks | 1154250 | |
| credit | 269945 | 3.4% |
| union | 269945 | 3.4% |
| no | 10556 | 0.1% |
| definition | 10556 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 10776206 | |
| n | 10111816 | |
| a | 5352532 | 7.4% |
| o | 4489339 | 6.2% |
| d | 4478783 | 6.2% |
| t | 4478783 | 6.2% |
| 4478783 | 6.2% | |
| g | 4198282 | 5.8% |
| p | 4198282 | 5.8% |
| s | 3253391 | 4.5% |
| Other values (11) | 16305851 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 59641146 | |
| Uppercase Letter | 8002119 | 11.1% |
| Space Separator | 4478783 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10776206 | |
| n | 10111816 | |
| a | 5352532 | |
| o | 4489339 | |
| d | 4478783 | |
| t | 4478783 | |
| g | 4198282 | 7.0% |
| p | 4198282 | 7.0% |
| s | 3253391 | 5.5% |
| i | 2670699 | 4.5% |
| Other values (4) | 5633033 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2369086 | |
| I | 2099141 | |
| M | 2099141 | |
| B | 1154250 | |
| U | 269945 | 3.4% |
| N | 10556 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4478783 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67643265 | |
| Common | 4478783 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10776206 | |
| n | 10111816 | |
| a | 5352532 | 7.9% |
| o | 4489339 | 6.6% |
| d | 4478783 | 6.6% |
| t | 4478783 | 6.6% |
| g | 4198282 | 6.2% |
| p | 4198282 | 6.2% |
| s | 3253391 | 4.8% |
| i | 2670699 | 3.9% |
| Other values (10) | 13635152 |
Common
| Value | Count | Frequency (%) |
| 4478783 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72122048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 10776206 | |
| n | 10111816 | |
| a | 5352532 | 7.4% |
| o | 4489339 | 6.2% |
| d | 4478783 | 6.2% |
| t | 4478783 | 6.2% |
| 4478783 | 6.2% | |
| g | 4198282 | 5.8% |
| p | 4198282 | 5.8% |
| s | 3253391 | 4.5% |
| Other values (11) | 16305851 |
lender_size
Real number (ℝ)
| Distinct | 1984 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149673.45 |
| Minimum | 1 |
|---|---|
| Maximum | 1026755 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 696 |
| Q1 | 5788 |
| median | 26178 |
| Q3 | 160012 |
| 95-th percentile | 774905 |
| Maximum | 1026755 |
| Range | 1026754 |
| Interquartile range (IQR) | 154224 |
Descriptive statistics
| Standard deviation | 248883.61 |
|---|---|
| Coefficient of variation (CV) | 1.6628441 |
| Kurtosis | 3.994445 |
| Mean | 149673.45 |
| Median Absolute Deviation (MAD) | 25011 |
| Skewness | 2.1595154 |
| Sum | 5.2892982 × 1011 |
| Variance | 6.1943053 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 410835 | 146341 | 4.1% |
| 774905 | 143112 | 4.0% |
| 1026755 | 117048 | 3.3% |
| 198516 | 83237 | 2.4% |
| 466552 | 68520 | 1.9% |
| 527621 | 68062 | 1.9% |
| 282102 | 64455 | 1.8% |
| 257847 | 50048 | 1.4% |
| 130400 | 46138 | 1.3% |
| 380650 | 41667 | 1.2% |
| Other values (1974) | 2705264 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 3 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 6 | |
| 7 | 3 | < 0.1% |
| 8 | 8 | |
| 9 | 5 | |
| 10 | 10 | |
| 11 | 9 | |
| 12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1026755 | 117048 | |
| 774905 | 143112 | |
| 527621 | 68062 | |
| 466552 | 68520 | |
| 410835 | 146341 | |
| 380650 | 41667 | 1.2% |
| 308884 | 20575 | 0.6% |
| 302784 | 8643 | 0.2% |
| 282102 | 64455 | |
| 257847 | 50048 | 1.4% |
white_population_pct
Real number (ℝ)
| Distinct | 70236 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 24986 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.847609 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 4183 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13.468089 |
| Q1 | 52.184386 |
| median | 74.042918 |
| Q3 | 86.895476 |
| 95-th percentile | 95.595182 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 34.71109 |
Descriptive statistics
| Standard deviation | 25.179598 |
|---|---|
| Coefficient of variation (CV) | 0.37667163 |
| Kurtosis | -0.090310374 |
| Mean | 66.847609 |
| Median Absolute Deviation (MAD) | 15.335483 |
| Skewness | -0.91371837 |
| Sum | 2.3456198 × 108 |
| Variance | 634.01214 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4183 | 0.1% |
| 25.08549218 | 1822 | 0.1% |
| 38.83451625 | 1639 | < 0.1% |
| 64.63691767 | 1429 | < 0.1% |
| 69.87282823 | 1224 | < 0.1% |
| 71.10172718 | 1213 | < 0.1% |
| 42.08020433 | 1202 | < 0.1% |
| 53.94154736 | 1202 | < 0.1% |
| 75.02294104 | 1170 | < 0.1% |
| 83.06857931 | 1079 | < 0.1% |
| Other values (70226) | 3492743 | |
| (Missing) | 24986 | 0.7% |
| Value | Count | Frequency (%) |
| 0 | 4183 | |
| 0.01163196464 | 15 | < 0.1% |
| 0.01282709082 | 3 | < 0.1% |
| 0.01891431814 | 85 | < 0.1% |
| 0.02134927412 | 1 | < 0.1% |
| 0.02161694769 | 2 | < 0.1% |
| 0.02297794118 | 15 | < 0.1% |
| 0.02919708029 | 4 | < 0.1% |
| 0.03082614057 | 16 | < 0.1% |
| 0.03355704698 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 823 | |
| 99.96020692 | 7 | < 0.1% |
| 99.95645548 | 31 | < 0.1% |
| 99.93152248 | 22 | < 0.1% |
| 99.92146597 | 22 | < 0.1% |
| 99.92142483 | 5 | < 0.1% |
| 99.91421218 | 36 | < 0.1% |
| 99.90821478 | 12 | < 0.1% |
| 99.89059081 | 21 | < 0.1% |
| 99.88502443 | 46 | < 0.1% |
metro_name
Text
MISSING 
| Distinct | 959 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 131511 |
| Missing (%) | 3.7% |
| Memory size | 53.9 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 36 |
| Mean length | 24.948878 |
| Min length | 7 |
Characters and Unicode
| Total characters | 84885590 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ames, IA |
|---|---|
| 2nd row | Mason City, IA |
| 3rd row | Mason City, IA |
| 4th row | Albert Lea, MN |
| 5th row | Ames, IA |
| Value | Count | Frequency (%) |
| tx | 310310 | 3.5% |
| ca | 302059 | 3.4% |
| fl | 283395 | 3.2% |
| ga | 116800 | 1.3% |
| city | 115106 | 1.3% |
| il | 114524 | 1.3% |
| pa | 112277 | 1.3% |
| new | 111538 | 1.2% |
| mi | 104344 | 1.2% |
| az | 103039 | 1.2% |
| Other values (1079) | 7261084 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6183985 | 7.3% |
| e | 5545254 | 6.5% |
| 5532095 | 6.5% | |
| n | 5378296 | 6.3% |
| o | 5047813 | 5.9% |
| - | 4672915 | 5.5% |
| r | 4107846 | 4.8% |
| l | 3725970 | 4.4% |
| i | 3663637 | 4.3% |
| t | 3459004 | 4.1% |
| Other values (52) | 37568775 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53385226 | |
| Uppercase Letter | 17718898 | 20.9% |
| Space Separator | 5532095 | 6.5% |
| Dash Punctuation | 4672915 | 5.5% |
| Other Punctuation | 3576456 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6183985 | |
| e | 5545254 | |
| n | 5378296 | |
| o | 5047813 | |
| r | 4107846 | 7.7% |
| l | 3725970 | 7.0% |
| i | 3663637 | 6.9% |
| t | 3459004 | 6.5% |
| s | 2931410 | 5.5% |
| d | 1856888 | 3.5% |
| Other values (20) | 11485123 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1969155 | 11.1% |
| A | 1897913 | 10.7% |
| N | 1301318 | 7.3% |
| S | 1210554 | 6.8% |
| L | 1128171 | 6.4% |
| M | 1050505 | 5.9% |
| T | 815699 | 4.6% |
| P | 786605 | 4.4% |
| W | 779269 | 4.4% |
| B | 769387 | 4.3% |
| Other values (16) | 6010322 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3402381 | |
| . | 153416 | 4.3% |
| / | 17761 | 0.5% |
| ' | 2898 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5532095 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4672915 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 71104124 | |
| Common | 13781466 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6183985 | 8.7% |
| e | 5545254 | 7.8% |
| n | 5378296 | 7.6% |
| o | 5047813 | 7.1% |
| r | 4107846 | 5.8% |
| l | 3725970 | 5.2% |
| i | 3663637 | 5.2% |
| t | 3459004 | 4.9% |
| s | 2931410 | 4.1% |
| C | 1969155 | 2.8% |
| Other values (46) | 29091754 |
Common
| Value | Count | Frequency (%) |
| 5532095 | ||
| - | 4672915 | |
| , | 3402381 | |
| . | 153416 | 1.1% |
| / | 17761 | 0.1% |
| ' | 2898 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 84877388 | |
| None | 8202 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6183985 | 7.3% |
| e | 5545254 | 6.5% |
| 5532095 | 6.5% | |
| n | 5378296 | 6.3% |
| o | 5047813 | 5.9% |
| - | 4672915 | 5.5% |
| r | 4107846 | 4.8% |
| l | 3725970 | 4.4% |
| i | 3663637 | 4.3% |
| t | 3459004 | 4.1% |
| Other values (48) | 37560573 |
None
| Value | Count | Frequency (%) |
| ó | 7303 | |
| ñ | 551 | 6.7% |
| á | 209 | 2.5% |
| ü | 139 | 1.7% |
metro_size_percentile
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| 90th percentile | |
|---|---|
| 80th percentile | |
| 99th percentile | |
| 70th percentile | |
| Micro area | |
| Other values (7) |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.667116 |
| Min length | 10 |
Characters and Unicode
| Total characters | 51832003 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0th percentile |
|---|---|
| 2nd row | 0th percentile |
| 3rd row | 10th percentile |
| 4th row | 0th percentile |
| 5th row | 0th percentile |
Common Values
| Value | Count | Frequency (%) |
| 90th percentile | 1252242 | |
| 80th percentile | 583803 | |
| 99th percentile | 371328 | 10.5% |
| 70th percentile | 348534 | 9.9% |
| Micro area | 202004 | 5.7% |
| 60th percentile | 195259 | 5.5% |
| 0th percentile | 166357 | 4.7% |
| 50th percentile | 133795 | 3.8% |
| 40th percentile | 97165 | 2.7% |
| 30th percentile | 78160 | 2.2% |
| Other values (2) | 105245 | 3.0% |
Length
| Value | Count | Frequency (%) |
| percentile | 3331888 | |
| 90th | 1252242 | 17.7% |
| 80th | 583803 | 8.3% |
| 99th | 371328 | 5.3% |
| 70th | 348534 | 4.9% |
| micro | 202004 | 2.9% |
| area | 202004 | 2.9% |
| 60th | 195259 | 2.8% |
| 0th | 166357 | 2.4% |
| 50th | 133795 | 1.9% |
| Other values (4) | 280570 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 10197668 | |
| t | 6663776 | |
| r | 3735896 | 7.2% |
| 3533892 | 6.8% | |
| c | 3533892 | 6.8% |
| i | 3533892 | 6.8% |
| l | 3331888 | 6.4% |
| h | 3331888 | 6.4% |
| p | 3331888 | 6.4% |
| n | 3331888 | 6.4% |
| Other values (13) | 7305435 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41598688 | |
| Decimal Number | 6497419 | 12.5% |
| Space Separator | 3533892 | 6.8% |
| Uppercase Letter | 202004 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10197668 | |
| t | 6663776 | |
| r | 3735896 | 9.0% |
| c | 3533892 | 8.5% |
| i | 3533892 | 8.5% |
| l | 3331888 | 8.0% |
| h | 3331888 | 8.0% |
| p | 3331888 | 8.0% |
| n | 3331888 | 8.0% |
| a | 404008 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2960560 | |
| 9 | 1994898 | |
| 8 | 583803 | 9.0% |
| 7 | 348534 | 5.4% |
| 6 | 195259 | 3.0% |
| 5 | 133795 | 2.1% |
| 4 | 97165 | 1.5% |
| 3 | 78160 | 1.2% |
| 2 | 58483 | 0.9% |
| 1 | 46762 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 3533892 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 202004 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41800692 | |
| Common | 10031311 | 19.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10197668 | |
| t | 6663776 | |
| r | 3735896 | 8.9% |
| c | 3533892 | 8.5% |
| i | 3533892 | 8.5% |
| l | 3331888 | 8.0% |
| h | 3331888 | 8.0% |
| p | 3331888 | 8.0% |
| n | 3331888 | 8.0% |
| a | 404008 | 1.0% |
| Other values (2) | 404008 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 3533892 | ||
| 0 | 2960560 | |
| 9 | 1994898 | |
| 8 | 583803 | 5.8% |
| 7 | 348534 | 3.5% |
| 6 | 195259 | 1.9% |
| 5 | 133795 | 1.3% |
| 4 | 97165 | 1.0% |
| 3 | 78160 | 0.8% |
| 2 | 58483 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51832003 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 10197668 | |
| t | 6663776 | |
| r | 3735896 | 7.2% |
| 3533892 | 6.8% | |
| c | 3533892 | 6.8% |
| i | 3533892 | 6.8% |
| l | 3331888 | 6.4% |
| h | 3331888 | 6.4% |
| p | 3331888 | 6.4% |
| n | 3331888 | 6.4% |
| Other values (13) | 7305435 |
state_code
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 24438 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.016897 |
| Minimum | 1 |
|---|---|
| Maximum | 72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 12 |
| median | 27 |
| Q3 | 42 |
| 95-th percentile | 53 |
| Maximum | 72 |
| Range | 71 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 16.28383 |
|---|---|
| Coefficient of variation (CV) | 0.58121464 |
| Kurtosis | -1.2835181 |
| Mean | 28.016897 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.054109569 |
| Sum | 98324010 |
| Variance | 265.16313 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 319004 | 9.0% |
| 6 | 304379 | 8.6% |
| 12 | 284738 | 8.1% |
| 17 | 136024 | 3.8% |
| 39 | 132836 | 3.8% |
| 36 | 130346 | 3.7% |
| 13 | 128627 | 3.6% |
| 42 | 124632 | 3.5% |
| 37 | 123998 | 3.5% |
| 26 | 111038 | 3.1% |
| Other values (42) | 1713832 |
| Value | Count | Frequency (%) |
| 1 | 47561 | 1.3% |
| 2 | 6046 | 0.2% |
| 4 | 103213 | 2.9% |
| 5 | 26752 | 0.8% |
| 6 | 304379 | |
| 8 | 89114 | 2.5% |
| 9 | 37986 | 1.1% |
| 10 | 11552 | 0.3% |
| 11 | 7693 | 0.2% |
| 12 | 284738 |
| Value | Count | Frequency (%) |
| 72 | 9058 | 0.3% |
| 56 | 5723 | 0.2% |
| 55 | 66622 | 1.9% |
| 54 | 12450 | 0.4% |
| 53 | 95381 | 2.7% |
| 51 | 93976 | 2.7% |
| 50 | 5331 | 0.2% |
| 49 | 49623 | 1.4% |
| 48 | 319004 | |
| 47 | 80198 | 2.3% |
county_code
Real number (ℝ)
| Distinct | 321 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 24438 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.691435 |
| Minimum | 1 |
|---|---|
| Maximum | 840 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 29 |
| median | 65 |
| Q3 | 111 |
| 95-th percentile | 217 |
| Maximum | 840 |
| Range | 839 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 98.908476 |
|---|---|
| Coefficient of variation (CV) | 1.1279149 |
| Kurtosis | 14.265155 |
| Mean | 87.691435 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 3.2115213 |
| Sum | 3.0774906 × 108 |
| Variance | 9782.8867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 123326 | 3.5% |
| 3 | 113539 | 3.2% |
| 31 | 113474 | 3.2% |
| 37 | 87428 | 2.5% |
| 1 | 83268 | 2.4% |
| 59 | 70644 | 2.0% |
| 5 | 70375 | 2.0% |
| 29 | 63508 | 1.8% |
| 35 | 62245 | 1.8% |
| 11 | 61390 | 1.7% |
| Other values (311) | 2660257 |
| Value | Count | Frequency (%) |
| 1 | 83268 | |
| 3 | 113539 | |
| 5 | 70375 | |
| 6 | 46 | < 0.1% |
| 7 | 35986 | 1.0% |
| 9 | 44933 | 1.3% |
| 11 | 61390 | |
| 12 | 52 | < 0.1% |
| 13 | 123326 | |
| 14 | 1290 | < 0.1% |
| Value | Count | Frequency (%) |
| 840 | 224 | < 0.1% |
| 830 | 112 | < 0.1% |
| 820 | 355 | < 0.1% |
| 810 | 4444 | |
| 800 | 980 | < 0.1% |
| 790 | 333 | < 0.1% |
| 775 | 263 | < 0.1% |
| 770 | 1087 | < 0.1% |
| 760 | 2455 | |
| 750 | 72 | < 0.1% |
census_tract
Text
| Distinct | 120156 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.664663 |
| Min length | 2 |
Characters and Unicode
| Total characters | 44755551 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17947 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 19081270100.0 |
|---|---|
| 2nd row | 19081270200.0 |
| 3rd row | 19169010600.0 |
| 4th row | 19081270100.0 |
| 5th row | 19081270100.0 |
| Value | Count | Frequency (%) |
| nan | 24890 | 0.7% |
| 48157672900.0 | 1722 | < 0.1% |
| 48201542900.0 | 1591 | < 0.1% |
| 48157673200.0 | 1371 | < 0.1% |
| 48439114103.0 | 1177 | < 0.1% |
| 48085030203.0 | 1161 | < 0.1% |
| 48157673400.0 | 1152 | < 0.1% |
| 48157673101.0 | 1145 | < 0.1% |
| 48085030305.0 | 1127 | < 0.1% |
| 48329010112.0 | 1024 | < 0.1% |
| Other values (120146) | 3497532 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15730394 | |
| 1 | 6134505 | 13.7% |
| 3 | 3534557 | 7.9% |
| 2 | 3491795 | 7.8% |
| . | 3344435 | 7.5% |
| 5 | 2581830 | 5.8% |
| 4 | 2509643 | 5.6% |
| 7 | 2064897 | 4.6% |
| 9 | 1988150 | 4.4% |
| 6 | 1791135 | 4.0% |
| Other values (4) | 1584210 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 41336444 | |
| Other Punctuation | 3344435 | 7.5% |
| Lowercase Letter | 74671 | 0.2% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15730394 | |
| 1 | 6134505 | 14.8% |
| 3 | 3534557 | 8.6% |
| 2 | 3491795 | 8.4% |
| 5 | 2581830 | 6.2% |
| 4 | 2509643 | 6.1% |
| 7 | 2064897 | 5.0% |
| 9 | 1988150 | 4.8% |
| 6 | 1791135 | 4.3% |
| 8 | 1509538 | 3.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 49780 | |
| a | 24891 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3344435 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 44680879 | |
| Latin | 74672 | 0.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15730394 | |
| 1 | 6134505 | 13.7% |
| 3 | 3534557 | 7.9% |
| 2 | 3491795 | 7.8% |
| . | 3344435 | 7.5% |
| 5 | 2581830 | 5.8% |
| 4 | 2509643 | 5.6% |
| 7 | 2064897 | 4.6% |
| 9 | 1988150 | 4.4% |
| 6 | 1791135 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| n | 49780 | |
| a | 24891 | |
| N | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44755551 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15730394 | |
| 1 | 6134505 | 13.7% |
| 3 | 3534557 | 7.9% |
| 2 | 3491795 | 7.8% |
| . | 3344435 | 7.5% |
| 5 | 2581830 | 5.8% |
| 4 | 2509643 | 5.6% |
| 7 | 2064897 | 4.6% |
| 9 | 1988150 | 4.4% |
| 6 | 1791135 | 4.0% |
| Other values (4) | 1584210 | 3.5% |
activity_year
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| 2019 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 14135568 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2019 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2019 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 3533892 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2019 | 3533892 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3533892 | |
| 0 | 3533892 | |
| 1 | 3533892 | |
| 9 | 3533892 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14135568 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3533892 | |
| 0 | 3533892 | |
| 1 | 3533892 | |
| 9 | 3533892 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14135568 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3533892 | |
| 0 | 3533892 | |
| 1 | 3533892 | |
| 9 | 3533892 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14135568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3533892 | |
| 0 | 3533892 | |
| 1 | 3533892 | |
| 9 | 3533892 |
loan_outcome
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
| Loan originated (approved) | |
|---|---|
| Loan denied | 318170 |
Length
| Max length | 26 |
|---|---|
| Median length | 26 |
| Mean length | 24.649492 |
| Min length | 11 |
Characters and Unicode
| Total characters | 87108642 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Loan originated (approved) |
|---|---|
| 2nd row | Loan originated (approved) |
| 3rd row | Loan originated (approved) |
| 4th row | Loan originated (approved) |
| 5th row | Loan originated (approved) |
Common Values
| Value | Count | Frequency (%) |
| Loan originated (approved) | 3215722 | |
| Loan denied | 318170 | 9.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| loan | 3533892 | |
| originated | 3215722 | |
| approved | 3215722 | |
| denied | 318170 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 9965336 | |
| a | 9965336 | |
| n | 7067784 | |
| e | 7067784 | |
| d | 7067784 | |
| 6749614 | ||
| i | 6749614 | |
| r | 6431444 | 7.4% |
| p | 6431444 | 7.4% |
| L | 3533892 | 4.1% |
| Other values (5) | 16078610 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70393692 | |
| Space Separator | 6749614 | 7.7% |
| Uppercase Letter | 3533892 | 4.1% |
| Open Punctuation | 3215722 | 3.7% |
| Close Punctuation | 3215722 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 9965336 | |
| a | 9965336 | |
| n | 7067784 | |
| e | 7067784 | |
| d | 7067784 | |
| i | 6749614 | |
| r | 6431444 | |
| p | 6431444 | |
| g | 3215722 | 4.6% |
| t | 3215722 | 4.6% |
Space Separator
| Value | Count | Frequency (%) |
| 6749614 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3533892 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3215722 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3215722 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 73927584 | |
| Common | 13181058 | 15.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 9965336 | |
| a | 9965336 | |
| n | 7067784 | |
| e | 7067784 | |
| d | 7067784 | |
| i | 6749614 | |
| r | 6431444 | |
| p | 6431444 | |
| L | 3533892 | 4.8% |
| g | 3215722 | 4.3% |
| Other values (2) | 6431444 |
Common
| Value | Count | Frequency (%) |
| 6749614 | ||
| ( | 3215722 | |
| ) | 3215722 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 87108642 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 9965336 | |
| a | 9965336 | |
| n | 7067784 | |
| e | 7067784 | |
| d | 7067784 | |
| 6749614 | ||
| i | 6749614 | |
| r | 6431444 | 7.4% |
| p | 6431444 | 7.4% |
| L | 3533892 | 4.1% |
| Other values (5) | 16078610 |
lender_id
Text
| Distinct | 5101 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.9 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Characters and Unicode
| Total characters | 70677840 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 100 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 25490003YGASV5ENH153 |
|---|---|
| 2nd row | 25490003YGASV5ENH153 |
| 3rd row | 25490003YGASV5ENH153 |
| 4th row | 25490003YGASV5ENH153 |
| 5th row | 25490003YGASV5ENH153 |
| Value | Count | Frequency (%) |
| 549300hw662mn1wu8550 | 146341 | 4.1% |
| 549300fgxn1k3hlb1r50 | 143112 | 4.0% |
| kb1h1dsprfmymcufxt09 | 117048 | 3.3% |
| 549300mgpzblqdil7538 | 83237 | 2.4% |
| b4tydeb6gkmzo031mb27 | 68520 | 1.9% |
| 7h6glxdrugqfu57rne97 | 68062 | 1.9% |
| 549300j7xkt2bi5wx213 | 64455 | 1.8% |
| 549300ag64nhilb7zp05 | 50048 | 1.4% |
| 549300u3721pjgqzyy68 | 46138 | 1.3% |
| 6byl5qzybdk8s7l73m02 | 41667 | 1.2% |
| Other values (5091) | 2705264 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7769172 | 11.0% |
| 5 | 5023608 | 7.1% |
| 3 | 4597868 | 6.5% |
| 4 | 4492322 | 6.4% |
| 9 | 3989064 | 5.6% |
| 1 | 2588454 | 3.7% |
| 2 | 2217796 | 3.1% |
| 7 | 2073209 | 2.9% |
| 6 | 2040331 | 2.9% |
| 8 | 1685160 | 2.4% |
| Other values (26) | 34200856 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36476984 | |
| Uppercase Letter | 34200856 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1643802 | 4.8% |
| H | 1614982 | 4.7% |
| M | 1609071 | 4.7% |
| N | 1568954 | 4.6% |
| D | 1565974 | 4.6% |
| R | 1565240 | 4.6% |
| W | 1481396 | 4.3% |
| S | 1462324 | 4.3% |
| K | 1440948 | 4.2% |
| L | 1440661 | 4.2% |
| Other values (16) | 18807504 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7769172 | |
| 5 | 5023608 | |
| 3 | 4597868 | |
| 4 | 4492322 | |
| 9 | 3989064 | |
| 1 | 2588454 | 7.1% |
| 2 | 2217796 | 6.1% |
| 7 | 2073209 | 5.7% |
| 6 | 2040331 | 5.6% |
| 8 | 1685160 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36476984 | |
| Latin | 34200856 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 1643802 | 4.8% |
| H | 1614982 | 4.7% |
| M | 1609071 | 4.7% |
| N | 1568954 | 4.6% |
| D | 1565974 | 4.6% |
| R | 1565240 | 4.6% |
| W | 1481396 | 4.3% |
| S | 1462324 | 4.3% |
| K | 1440948 | 4.2% |
| L | 1440661 | 4.2% |
| Other values (16) | 18807504 |
Common
| Value | Count | Frequency (%) |
| 0 | 7769172 | |
| 5 | 5023608 | |
| 3 | 4597868 | |
| 4 | 4492322 | |
| 9 | 3989064 | |
| 1 | 2588454 | 7.1% |
| 2 | 2217796 | 6.1% |
| 7 | 2073209 | 5.7% |
| 6 | 2040331 | 5.6% |
| 8 | 1685160 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70677840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7769172 | 11.0% |
| 5 | 5023608 | 7.1% |
| 3 | 4597868 | 6.5% |
| 4 | 4492322 | 6.4% |
| 9 | 3989064 | 5.6% |
| 1 | 2588454 | 3.7% |
| 2 | 2217796 | 3.1% |
| 7 | 2073209 | 2.9% |
| 6 | 2040331 | 2.9% |
| 8 | 1685160 | 2.4% |
| Other values (26) | 34200856 |
| race | sex | co_applicant | age | income | loan_amount | property_value_ratio | mortgage_term | credit_model | debt_to_income_ratio | combined_loan_to_value_ratio | main_underwriter | tract_to_metro_income_percentage | lender_type | lender_size | white_population_pct | metro_name | metro_size_percentile | state_code | county_code | census_tract | activity_year | loan_outcome | lender_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | White | Female | No co-applicant | 25 through 34 | 23.0 | 25000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 94.487578 | NaN | 0th percentile | 19.0 | 81.0 | 19081270100.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 1 | White | Male | No co-applicant | 25 through 34 | 42.0 | 85000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 93.845535 | NaN | 0th percentile | 19.0 | 81.0 | 19081270200.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 2 | White | Male | Co-applicant | 25 through 34 | 125.0 | 95000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 96.340348 | Ames, IA | 10th percentile | 19.0 | 169.0 | 19169010600.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 3 | White | Male | No co-applicant | Less than 25 | 34.0 | 75000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 94.487578 | NaN | 0th percentile | 19.0 | 81.0 | 19081270100.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 4 | White | Female | No co-applicant | 25 through 34 | 37.0 | 145000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 94.487578 | NaN | 0th percentile | 19.0 | 81.0 | 19081270100.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 5 | White | Female | Co-applicant | 45 through 54 | 57.0 | 75000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 92.326835 | NaN | 0th percentile | 19.0 | 81.0 | 19081270300.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 6 | White | Male | No co-applicant | Less than 25 | 27.0 | 65000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 93.845535 | NaN | 0th percentile | 19.0 | 81.0 | 19081270200.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 7 | White | Male | No co-applicant | Less than 25 | 32.0 | 75000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 94.487578 | NaN | 0th percentile | 19.0 | 81.0 | 19081270100.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 8 | White | Male | No co-applicant | 25 through 34 | 35.0 | 25000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 93.845535 | NaN | 0th percentile | 19.0 | 81.0 | 19081270200.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| 9 | White | Male | Co-applicant | 35 through 44 | 123.0 | 175000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 107 | 92.326835 | NaN | 0th percentile | 19.0 | 81.0 | 19081270300.0 | 2019 | Loan originated (approved) | 25490003YGASV5ENH153 |
| race | sex | co_applicant | age | income | loan_amount | property_value_ratio | mortgage_term | credit_model | debt_to_income_ratio | combined_loan_to_value_ratio | main_underwriter | tract_to_metro_income_percentage | lender_type | lender_size | white_population_pct | metro_name | metro_size_percentile | state_code | county_code | census_tract | activity_year | loan_outcome | lender_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4456601 | Race NA | NA | No co-applicant | 35 through 44 | 487.0 | 1425000 | 3.565 | 30 year mortgage | Experian | Healthy (<36%) | 85.000 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 1894 | 70.298235 | Cambridge-Newton-Framingham, MA | 90th percentile | 25.0 | 17.0 | 25017359100.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456602 | Race NA | NA | Co-applicant | 25 through 34 | 56.0 | 215000 | 0.646 | 30 year mortgage | NA | Manageable (36-42%) | 80.000 | Not Applicable | Low (<50%) | Independent Mortgage Companies | 1894 | 28.293474 | Cambridge-Newton-Framingham, MA | 90th percentile | 25.0 | 9.0 | 25009207200.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456603 | Race NA | NA | No co-applicant | 35 through 44 | 185.0 | 725000 | 1.823 | 30 year mortgage | Experian | Manageable (36-42%) | 80.000 | Not Applicable | Middle (80-120%) | Independent Mortgage Companies | 1894 | 66.493169 | Boston, MA | 90th percentile | 25.0 | 25.0 | 25025000301.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456604 | White | Male | No co-applicant | 55 through 64 | 300.0 | 1275000 | 4.244 | 30 year mortgage | NA | Manageable (36-42%) | 60.000 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 1894 | 53.057369 | Cambridge-Newton-Framingham, MA | 90th percentile | 25.0 | 17.0 | 25017358300.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456607 | Race NA | Male | No co-applicant | 45 through 54 | 947.0 | 1495000 | 7.596 | 30 year mortgage | NA | Healthy (<36%) | 46.123 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 1894 | 79.474940 | Bridgeport-Stamford-Norwalk, CT | 80th percentile | 9.0 | 1.0 | 9001011100.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456608 | White | Female | Co-applicant | 55 through 64 | 196.0 | 375000 | 0.929 | 30 year mortgage | NA | Healthy (<36%) | 80.000 | Desktop Underwriter | Middle (80-120%) | Independent Mortgage Companies | 1894 | 80.891304 | Cambridge-Newton-Framingham, MA | 90th percentile | 25.0 | 17.0 | 25017317102.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456609 | Race NA | NA | Co-applicant | 55 through 64 | 68.0 | 315000 | 1.618 | 30 year mortgage | TransUnion | Unmanageable (43-49%) | 65.235 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 1894 | 89.135066 | Providence-Warwick, RI-MA | 80th percentile | 25.0 | 5.0 | 25005631700.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456611 | White | Male | Co-applicant | 35 through 44 | 365.0 | 865000 | 2.283 | 30 year mortgage | NA | Unmanageable (43-49%) | 80.000 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 1894 | 93.231994 | Boston, MA | 90th percentile | 25.0 | 21.0 | 25021409102.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456612 | Race NA | NA | No co-applicant | 45 through 54 | 25.0 | 85000 | 0.339 | 30 year mortgage | TransUnion | Unmanageable (43-49%) | 90.000 | Loan Prospector | Low (<50%) | Independent Mortgage Companies | 1894 | 48.053528 | Worcester, MA-CT | 80th percentile | 25.0 | 27.0 | 25027710700.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
| 4456613 | White | Female | Co-applicant | 25 through 34 | 318.0 | 685000 | 3.047 | 30 year mortgage | TransUnion | Healthy (<36%) | 95.000 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 1894 | 92.470277 | Worcester, MA-CT | 80th percentile | 25.0 | 27.0 | 25027715100.0 | 2019 | Loan originated (approved) | 549300L0OVX5O63S8C68 |
Most frequently occurring
| race | sex | co_applicant | age | income | loan_amount | property_value_ratio | mortgage_term | credit_model | debt_to_income_ratio | combined_loan_to_value_ratio | main_underwriter | tract_to_metro_income_percentage | lender_type | lender_size | white_population_pct | metro_name | metro_size_percentile | state_code | county_code | census_tract | activity_year | loan_outcome | lender_id | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 69 | Black | Male | No co-applicant | 45 through 54 | 90.0 | 135000 | 0.937 | 30 year mortgage | TransUnion | Healthy (<36%) | 90.000 | Technology Open to Approved Lenders | Middle (80-120%) | Independent Mortgage Companies | 257847 | 75.824411 | Pittsburgh, PA | 90th percentile | 42.0 | 3.0 | 42003523800 | 2019 | Loan denied | 549300AG64NHILB7ZP05 | 3 |
| 101 | Latino | Female | No co-applicant | 45 through 54 | 180.0 | 575000 | 1.126 | 30 year mortgage | TransUnion | Struggling (>50%) | 75.000 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 8613 | 60.801936 | Anaheim-Santa Ana-Irvine, CA | 90th percentile | 6.0 | 59.0 | 6059032034.0 | 2019 | Loan denied | 254900E6AIE4Z8YQM970 | 3 |
| 249 | White | Female | No co-applicant | Greater than 74 | 20.0 | 115000 | NaN | NA | NA | NA | NaN | Not Applicable | Middle (80-120%) | Independent Mortgage Companies | 5786 | 78.602620 | Forest City, NC | Micro area | 37.0 | 161.0 | 37161960900.0 | 2019 | Loan denied | 549300QUX3LK82LO3013 | 3 |
| 0 | Asian | Female | Co-applicant | 25 through 34 | 300.0 | 615000 | 1.289 | Less than 30 years | Equifax | Healthy (<36%) | 94.923 | Desktop Underwriter | Upper (>120%) | Independent Mortgage Companies | 68908 | 60.115875 | Washington-Arlington-Alexandria, DC-VA-MD-WV | 90th percentile | 51.0 | 107.0 | 51107611014.0 | 2019 | Loan denied | 549300YIQ7S7Z8PIHE53 | 2 |
| 1 | Asian | Female | Co-applicant | 35 through 44 | 180.0 | 1375000 | 2.941 | 30 year mortgage | Experian | Struggling (>50%) | 80.000 | Other | Upper (>120%) | Banks | 1026755 | 20.286396 | Los Angeles-Long Beach-Glendale, CA | 99th percentile | 6.0 | 37.0 | 6037431600.0 | 2019 | Loan denied | KB1H1DSPRFMYMCUFXT09 | 2 |
| 2 | Asian | Female | No co-applicant | 35 through 44 | 99.0 | 485000 | 1.113 | 30 year mortgage | TransUnion | Unmanageable (43-49%) | 80.000 | Not Applicable | Middle (80-120%) | Banks | 160012 | 24.318489 | New York-Jersey City-White Plains, NY-NJ | 99th percentile | 36.0 | 81.0 | 36081074700.0 | 2019 | Loan denied | AD6GFRVSDT01YPT1CS68 | 2 |
| 3 | Asian | Female | No co-applicant | 35 through 44 | 350.0 | 265000 | 1.151 | 30 year mortgage | TransUnion | Healthy (<36%) | 75.000 | Not Applicable | Upper (>120%) | Independent Mortgage Companies | 6277 | 88.827434 | Chicago-Naperville-Evanston, IL | 99th percentile | 17.0 | 43.0 | 17043844901.0 | 2019 | Loan denied | 549300EHQ0Y7SP41BR91 | 2 |
| 4 | Asian | Female | No co-applicant | 45 through 54 | 42.0 | 135000 | NaN | NA | NA | Exempt | NaN | Not Applicable | Middle (80-120%) | Banks | 218 | 84.139016 | Ocala, FL | 50th percentile | 12.0 | 83.0 | 12083002702.0 | 2019 | Loan originated (approved) | 254900G3JF710WUIHN65 | 2 |
| 5 | Asian | Female | No co-applicant | 55 through 64 | 480.0 | 305000 | 2.036 | 30 year mortgage | More than one | Healthy (<36%) | 30.000 | Not Applicable | Upper (>120%) | Banks | 31629 | 37.842324 | Nassau County-Suffolk County, NY | 90th percentile | 36.0 | 59.0 | 36059303101.0 | 2019 | Loan denied | TR24TWEY5RVRQV65HD49 | 2 |
| 6 | Asian | Male | Co-applicant | 25 through 34 | 185.0 | 15000 | 0.903 | 30 year mortgage | More than one | Healthy (<36%) | 102.945 | Not Applicable | Middle (80-120%) | Independent Mortgage Companies | 3514 | 79.897910 | Oakland-Berkeley-Livermore, CA | 90th percentile | 6.0 | 13.0 | 6013304004.0 | 2019 | Loan originated (approved) | 5493008E4KBJCB6UKR64 | 2 |